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|We bound the rate of convergence to uniformity for certain random walks on the 
complete monomial groups G I Sn for any group G. These results provide rates of 
convergence for random walks on a number of groups of interest: the hyperoctahedral 
group Z2 ; Sn, the generalized symmetric group Zm I Sn, and Sm I Sn- These results 
provide benchmarks to which many other random walks, modeling a wide range of 
phenomena, may be compared using the comparison technique, thereby yielding bounds 
on the rates of convergence to uniformity for previously intractable random walks. 

1. Introduction. How many steps does it take for a deck of n cards to achieve near- 
randomness if, at each step, two randomly chosen cards are transposed? This question was 
answered by Diaconis and Shahshahani (1981). Now suppose that at each step, the two cards 
are not only transposed, but also possibly flipped. Suppose that, instead of n cards, there 
are n wheels and that, at each step, two wheels are transposed and then spun. Or suppose 
that, instead of n wheels, there are n decks of cards and that, at each step, two decks are 
transposed and then shuffled. How many steps does it take for these processes to achieve 
near-randomness? 

Or perhaps we are interested in a process that is (at least somewhat) similar in form to 
one of those described above. Can we determine how many steps are needed for it to achieve 
near-randomness? These are the types of questions that we consider. 

For a certain random walk on the symmetric group Sn that is generated by random trans- 
positions, Diaconis and Shahshahani (1981) obtained bounds on the rate of convergence to 
uniformity using group representation theory. Similarly, we bound the rate of convergence 
to uniformity for a random walk on the complete monomial group G I Sn that is gener- 
ated by random transpositions, followed by independent randomizations of the transposed 
elements. Specifically, we determine that |nlogn + ^nlogdCI — 1|) steps are both necessary 
and sufflcient for i"^ distance to become small. We also determine that ^nlogn steps are both 
necessary and sufficient for total variation distance to become small. These results provide 
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rates of convergence for random walks on a number of groups of interest: the hyperoctahedral 
group Z2 I Sn, the generahzed symmetric group I Sn, and 5*^ I Sn- In the special case 
of the hyperoctahedral group, the two rates of convergence are the same in both metrics. We 
also examine a slight variant of this random walk, establishing upper and lower bounds on 
its rate of convergence to uniformity. 

The comparison technique was introduced by Diaconis and Saloff-Coste (1993) as a method 
for bounding the rate of convergence to uniformity of a symmetric random walk on a finite 
group by comparing it to a benchmark random walk whose rate of convergence is known. 
Our random walks on the hyperoctahedral, generalized symmetric, and complete monomial 
groups provide such benchmarks to which many other random walks, modeling a wide range 
of phenomena, may be compared, thereby yielding bounds on the rates of convergence to 
uniformity for previously intractable random walks. Schoolfield (1998) used the results of this 
paper to analyze two specific examples using this technique, one of which has applications 
in mathematical biology. Schoolfield (1998) further specialized the comparison technique to 
random walks generated by random transpositions along the edges of a graph and analyzed 
several examples. 

Section ^ presents the basic properties and results from group theory and representation 
theory necessary to analyze random walks on groups. Section ^ extends the idea of random 
transpositions of n cards, analyzed in Section to a set of n decks of m cards each and 
beyond. 



2. Groups, Representations, and Random Walks. 



2.1. Introduction. Imagine n cards, labeled 1 through n, on a table in sequential order. 
Independently choose two integers p and q uniformly from {1, 2, ... , n}. If p q, transpose 
the cards in positions p and q; we denote this by r G Sn, where r is the transposition {p q). 
We shall refer to this procedure as a random transposition of n cards. \i p = q (which occurs 
with probability 1/?t-) , leave the cards in their current positions. This action is of course the 
identity permutation, which is denoted hy e E Sn- 

If this process is repeated many times, the cards will appear to be in uniformly random 
order, that is, will appear to be the result of a random permutation. This process may be 
modeled formally using a probability measure P (which we shall regard as a probability mass 
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function) on the symmetric group Sn, namely, 



P(e) := — where e is the identity element, 

n 

2 



P(t) := — where r is any transposition, (2- 1-1) 

P(7r) := otherwise. 

Repeating the process above k times is modeled as the convolution P*^ of the measure P 
with itself k times. Since there are n! elements in Sn, the uniform probability measure on 
the set of all permutations of Sn is given by 

U(7c) := ^ for every vr G Sn- (2.1.2) 
n! 

The following result, which is Theorem 1 of Diaconis and Shahshahani (1981) and was later 
included in Diaconis (1988) as Theorem 5 in Section D of Chapter 3, establishes an upper 
bound on both the total variation distance and the i"^ distance (both defined in Section |2^ ) 
between P*'^ and U. 

Theorem 2.1.3. Let P and U be the probability measures on the symmetric group Sn 
defined in ^2.1.]\ ) and ^2.1.^ ), respectively. Let k = log ra + cn. Then there exists a universal 



constant a > such that 

||P*'= - f/||Tv < linl'f^ \\P*'' - Uy < ae-"^^ for all c> 0. 

In the following sections we present the results that were needed to prove this theorem and 
which are used to prove analogous results in later sections. In Section ^]2| we present basic 



definitions from group theory. In Section 2.3 we present basic results from the theory of group 



representations, while Section ^]4| concentrates on the characters of these representations. In 



Section 2.5 we introduce the Fourier transform, and we show how it may be used to bound 



the distance to uniformity of a random walk on a group in Section |2.6| . In Section |2.7| we 
show how the results from these previous sections were applied to the random walk on the 
symmetric group defined above, proving Theorem |2.1.3| and a matching lower bound. 



2.2. Group Theory. We now present basic definitions from the theory of groups which 
will be needed to conduct our analysis. A more detailed introduction to this subject may be 
found in Chapter 1 of Alperin and Bell (1995). 

A group is a non-empty set G with a binary operation on G, usually called multiplication, 
which satisfies the following three axioms: (i) Multiplication is associative, (ii) There is a 
unique identity element e G G, and (iii) For every g E G there is a unique inverse element 
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e G such that gg~^ — e — g~^g. The number of elements of a finite group G is called 
the order of G and is denoted by IGI- 

A subset if of a group G is called a subgroup of G if it forms a group under the multiplica- 
tion of G restricted to H. A subgroup N oiG is called a normal subgroup of G if gNg'^ C N 
for all g E G, where gNg~^ :— {gng~^ E G : n E N}. If gh — hg for all g,h E G, then 
G is called an abelian group. Notice that every subgroup of an abelian group is a normal 
subgroup. 

The set kH :— {kh E G : h E H}, where k E G and if is a subgroup of G, is called 
a left coset of H in G. (Right cosets are defined analogously.) The set of all left cosets 
of if in G is called the left coset space and is denoted by G/H. If \G/H\ — n, a, set of left 
coset representatives {ki, k2, ■ ■ ■ , kn} may be chosen so that the left cosets kiH, k2H, . . . , knH 
comprise precisely the coset space G/H. li N is a, normal subgroup of G, then G/N is also 
a group known as the quotient group. 

One group of interest to us is the cyclic group Z„, which is the set {0,1,2, ... ,n — 1} 
under the operation of addition mod n. Notice that Z„ is abelian and that |Z„| — n. Another 
group of interest to us is the symmetric group Sn, which is the set of all permutations of 
{1,2, ... ,n} under the operation of composition of functions (where we compose functions 
from right to left). Notice that Sn is not abelian for any n > 3, and that \Sn\ — n\. 

Two elements g and h oi G are called conjugate if there exists some k E G such that 
h — kgk~^. The set of elements conjugate to a particular element g E G form the conjugacy 
class of g. Conjugacy is an equivalence relation and partitions a group G into disjoint subsets, 
namely, the conjugacy classes. 

The set of ordered pairs of elements of groups Gi and G2 with multiplication defined 
componentwise is called the direct product of Gi and G2 and is denoted by Gi x G2. The 
group of elements {gi, g2, . . . , gn','^) £ G" x Sn, with multiplication defined by 

{hi, . . . ,hn;cr) ■ {gi,...,gn,'K) = {hi • 5'o-i(i)' ■ ■ ■ ' ' fl'o-H")' '^^)' 

is called the wreath product of G with Sn and is denoted G I Sn- The identity clement 
of G I Sn is (e, . . . , e; e), and in G I Sn we have (5-1, ... , gn, vr)"^ = (5'^(\), . . . , g:^f^y, tt"^). 
This definition is equivalent to saying that G I Sn is the semidirect product of G^ with 5'n, 
where the action of Sn on G" is n{gi,g2, ...,gn) = {g^-Hi), gn-H2), ■ ■ ■ , fi'7r-i(n))- For a general 
definition of semidirect product sec Alperin and Bell (1995) or Simon (1996). Three special 
cases of particular interest to us are the hyperoctahedral group Z2 I Sn, the generalized 
symmetric group Zm I Sn, and Sm I Sn- 
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2.3. Representation Theory. We now present basic properties and results from the 
representation theory of groups which will be needed to conduct our analysis. A more detailed 
introduction to this subject may be found in Chapters 1 through 3 of Serre (1977) or in 
Chapter 2 of Diaconis (1988). Other sources include Alperin and Bell (1995) and Simon 
(1996). 

Suppose that y is a finite-dimensional vector space over the complex numbers. The general 
linear group GL(y) is the group of isomorphisms of V onto itself. An element of GL(y) is 
a linear mapping of V onto V . Such a map has an inverse which is also linear. 

A representation of a finite group G in y is a homomorphism p : G — > GL{V). The choice 
of a basis for V assigns an invertible matrix p{g) to each g E G. There exists p{g) e GL(y) 
for each g E G such that p{gig2) — p{gi)p{g2) for all gi,g2 e G. This implies that p{e) — I 
and that p{g~^) — p{g)~^- 

Two representations of G, say pi in Vi and p2 in V2, are said to be isomorphic (or equiv- 
alent) if there exists a linear isomorphism p, : Vi — > V2 such that p o pi{g) — P2{g) ° A* 
for all g & G. The dimension (or degree) of p is defined to be the dimension of V and is 
denoted by dp. Since we shall be interested in representations only up to equivalence, we may 
without loss of generality assume that V — C"' when dp — n. The trivial representation is 
the one-dimensional representation with V — £■ that sends every element of G to 1. 

A vector subspace W QV is stable under G if p{g)w e W for all w E W and all g E G. 
If 1^ C F is stable under G, then p restricted to GL(W^) gives a subrepresentation of p. A 
representation p is irreducible if the only vector subspaces of V which are stable under G are V 
itself and the trivial subspace. An important relationship between irreducible representations 
and conjugacy classes is given by the following, which is Theorem 7 in Section 2.5 of Serre 
(1977). 

Proposition 2.3.1. The number of nonisomorphic irreducible representations of G 
equals the number of conjugacy classes of G. 

An important relationship between the dimensions of the irreducible representations of 
G and the order of G is given by the following, which is Corollary 2(a) of Proposition 5 in 
Section 2.4 of Serre (1977). 

Lemma 2.3.2. Suppose that {pi, p2, . . . , Ps} is a complete set of nonisomorphic irreducible 
representations of G with dimensions dp-^^jdp^, . . . , dp^, respectively. Then 
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i=l 

It follows directly from Proposition p.3.1| and Lemma |2.3.2| that a group G is abelian if 



and only if all of its irreducible representations are one-dimensional. 

The following useful result is Schur's Lemma, which is Proposition 4 in Section 2.2 of Serre 
(1977). 

Lemma 2.3.3. Suppose that pi : G — > GL(Fi) and p2 : G — > GL(V2) are irreducible 
representations of G and that p : Vi — > V2 is such that p o pi{g) = p2{g) o p for all g E G. 
Then (i) if pi and p2 are not isomorphic, it follows that p = 0; and (ii) ifVi = V2 and pi = p2, 
it follows that p is a constant times the identity. 

The direct sum A(BB of an m x m matrix A and a.nn x n matrix B is the (m+n) x (m+n) 
'A 



block diagonal matrix g ^ . The direct sum pi © p2 of two representations is then defined 

by (pi © P2) (g) = Pi{g) © P2{g)- By use of the direct sum, the irreducible representations 
of G can be used to construct all other representations of G, as described in the following, 
which is Theorem 2 in Section 1.4 of Serre (1977). 

Proposition 2.3.4. Every representation of G is the direct sum of irreducible represen- 
tations ofG. 

The tensor product A® B oi an m x m matrix A and an n x matrix B is an mn x mn 
matrix which is constructed in the following manner. Begin with an m x m block matrix 
in which each of the m? blocks is the matrix B. Then multiply the block in position (i, j) 
by the scalar aij G A for 1 < i, j < m. The tensor product pi © p2 of two representations is 
then defined by (pi © P2) (g) = Pi{g) © P2(fi')- By use of the tensor product, the irreducible 
representations of Gi and G2 can be used to construct all the irreducible representations of 
their direct product, as described in the following, which is Theorem 10 in Section 3.2 of 
Serre (1977). 



Proposition 2.3.5. Suppose that pi and p2 are irreducible representations of Gi and 
G2, respectively. Then pi © p2 is an irreducible representation of Gi x G2. Furthermore, 
each irreducible representation ofGi x G2 is isomorphic to such a representation. 
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Suppose that if is a subgroup of G and that p is a representation of G. A representation 
p J,^ of if, known as the restricted representation, can be constructed from p by defining 



Now suppose that if is a subgroup of G and that p is a representation of H. Also suppose 
that {/ci, k2, ■ ■ ■ , kn} is a complete set of left coset representatives of if in G. A representation 
p of G, known as the induced representation, can be constructed from p by defining, for 
each g & G, an n x n block matrix p |^ (g) whose block in position for 1 < i, j < n, 

is the dp X dp matrix 



The induced representation does not depend on the choice of coset representatives. 

2.4. Character Theory. The character of the representation p at the element g E G 
is defined to be the trace of p{g) and is denoted by Xp(g'). Notice that the character of a 
representation is independent of the choice of basis of V. The characters of the irreducible 
representations of G are called irreducible characters. The choice of the term "character" is 
to emphasize that it characterizes the representation; according to Corollary 2 of Theorem 
4 in Section 2.3 of Serre (1977), two representations are isomorphic if and only if they have 
the same character. 

Some important properties of characters are given in the following, which is Proposition 
1 in Section 2.1 of Serre (1977). 

Lemma 2.4.1. Suppose that x is the character of a representation p ofG with dimension 
dp. Then 



The characters of direct sums and tensor products may be easily calculated by use of the 
following, which is Proposition 2 in Section 2.1 of Serre (1977). 

Lemma 2.4.2. Suppose that pi and p2 are representations of G with characters Xi (ind 
X2, respectively. Then the character of the direct sum pi Q) p2 is Xi + X2 and the character of 
the tensor product pi ® P2 is Xi ' X2- 



p{h) for each h E H. 




(a) x(e) = dp for e e G, 



(b) xig-') = xig) for geG, 

(c) x{hgh-^) = xig) for g^heG. 
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The characters of induced representations may be calculated by use of the following, which 
is Corollary 6 in Section 16 of Alperin and Bell (1995). 

Lemma 2.4.3. Suppose that xh is the character of an irreducible representation Ph of a 
subgroup H of a finite group G. For g G G, suppose that the number t of conjugacy classes of 
H whose members are conjugate in G to g is positive. Let hi, h2, ■ ■ ■ ,ht be representatives of 
these t conjugacy classes of H and let ki, k2, ■ ■ ■ ,kt be the sizes of these classes. Let i be the 
size of the conjugacy class of g in G. Then the value at g of the character x of the induced 
representation pu Ti/ of G is given by 

\G\ L 



i=l 



Suppose that /i and /2 are functions on a group G. The inner product of /i and /2 is 
defined by 

(/i,/2)g := -^^^h{9)f2{g)- 
A function on G which is constant on each conjugacy class of G is called a class function. 



Notice that Lemma |2.4.1| (c) asserts that the character x of & representation p of G is a class 
function. An important relationship between the irreducible characters of G and the space 
of class functions on G is given by the following, which is Theorem 6 in Section 2.5 of Serre 
(1977). 

Proposition 2.4.4. The characters xi-, X2, ■ ■ ■ ,Xs of the irreducible representations of 
G form an orthonormal basis for the Hilbert space of class functions on G with respect to the 
inner product (■, ■) defined above. 

2.5. The Fourier Transform. We now introduce an extremely important tool for our 
analysis. Suppose that P is a function on G. The Fourier transform P of P at the represen- 
tation p is defined to be the matrix 

P(p) := Y.P(9)P{9). 

Suppose that P and Q are functions on G. The convolution P * Q of P and Q is defined, 
for all g E G,hj 



P*Q{9) ■■= Y.P^3h~')Q{h). 



heG 
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The Fourier transform converts the convolution of functions P and Q into the (argumen- 
twise) multiphcation of their transforms P and Q, i.e., 

pTQip) = P{p)Q{p). 



In the special case when P is a class function, it is a consequence of Schur's Lemma (|2.3.3| ) 
that the Fourier transform may be calculated easily by use of the following, which is Lemma 
5 of Diaconis and Shahshahani (1981). 

Lemma 2.5.1. Suppose that p is an irreducible representation of a finite group G with 
character x (^nd that P is a class function. For each conjugacy class i, let Pi be the constant 
value of P on the class, let Ui be the cardinality of the class, and let Xi be the constant value 
of X on the class. Then the Fourier transform of P is given by 



Pip) 



1 

- 2^ PiHiXi 



dp . ^ 
^ 1=1 



I, 



where dp is the dimension of p, I is the dp- dimensional identity matrix, and the sum is taken 
over distinct conjugacy classes. 

The Fourier transforms of any distribution at the trivial representation and of the uniform 
distribution at any nontrivial representation have special forms, as described in the following, 
which is an immediate consequence of the preceding lemma and Proposition |2.4.4. 



Corollary 2.5.2. Suppose that P is any probability measure and that U is the uniform 
probability measure, both defined on a finite group G. Then P{po) = 1 at the trivial repre- 
sentation Pq of G and U{p) is the dp x dp zero matrix for each nontrivial representation p of 
G. 

The function P on G may be reconstructed from its Fourier transform by use of the 
following, which is the Fourier inversion formula in Section 6.2 of Serre (1977). 

Lemma 2.5.3. Suppose that P is a function on G and that P is its Fourier transform. 
Then for each g E G 

m = p$^t/p,tr(A(r')P(p.)), 

' ' i=l 

where pi, P2, ■ ■ ■ , Ps are the nonisomorphic irreducible representations of G with dimensions 
dp^, dp2, . . . , dp^, respectively. 
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A consequence of this formula is another useful result, which is the Plancherel formula in 
Section 6.2 of Serre (1977). 

Lemma 2.5.4. Suppose that P and Q are functions on G and that P and Q are their 
Fourier transforms. Then 

geG ' ' i=l 

where pi, p2, ■ ■ ■ , Ps are the nonisomorphic irreducible representations of G with dimensions 
dp^,dp2, . . . , dp^, respectively. 

2.6. Random Walks on Groups. We now turn our attention to the subject of random 
walks on groups. A more detailed introduction to this subject may be found in Chapter 3 of 
Diaconis (1988). 

Suppose that P is a probability measure defined on a group G. Let ^i, ^2, ■ ■ ■ be a sequence 
of independent G- valued random variables each distributed according to P. A random walk 
on G is a sequence X = [Xq, Xi, X2, . . .) defined hy Xq :— e & G and X„ := ^nCn-i " " " 
all n > 1. Notice that X is a Markov chain with state space G: 

'^{Xfi — gn\ — e, Xi—gi, . . . , Xn-l — Qn-l} 

— P{CnCn-l ' " ^1 — 9n \ ^1 — 9l, ■ ■ ■ , ' ' ' ^1 — 9n-l} 

= mn = gngn-l} = P{9n9n-l) 

for all n > 1 and all (71, . . . , 5f„ G G. In this way a probability measure P on G induces a 
transition matrix P, where the entry of P at the intersection of the row corresponding to 
g & G and the column corresponding to /i e G is given by 

P,, = P{hg-^). 

In the special case where P is a class function, we may determine all the eigenvalues of 
the transition matrix P, together with their multiplicities, by use of the following, which is 
Corollary 3 of Diaconis and Shahshahani (1981). 

Lemma 2.6.1. Suppose P is a probability measure defined on a finite group G and that 
P is a class function. Let P be the transition matrix of the Markov chain induced by the 
probability measure P. Then, for each irreducible representation p ofG, there is an eigenvalue 
Tip of P occurring with algebraic multiplicity such that 
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i=l 

where the sum is taken over distinct conjugacy classes. 



Notice that vTp ■ / in the lemma above is exactly the value of P{p) in Lemma |2.5.1 . 

In order to discuss the convergence of these random walks to their stationary distributions, 
we need a metric between probability measures. Suppose that P and Q are two probability 
measures defined on a finite group G. The total variation distance between P and Q is defined 
by 

||P-g||Tv := m^x\P{A) - Q{A)\, 
while the distance between P and Q is defined by 

\\P-Qh := J2\P{9)-Q{9)\. 

g&G 

Notice that ||-P — Q||tv = The £^ distance between P and Q is defined by 

1/2 



\P-Qh ■■= (j2\P(9)-Q{9)A . 

VoeG / 



KgeG 

All three of these measures of distance are indeed metrics. It is a direct consequence of the 
Cauchy-Schwarz inequality that 



ip-qWty < m\\p-Q\ 



We are now able to bound the distance to uniformity of a probability measure P in terms 
of its Fourier transform by use of the following, which is the Upper Bound Lemma in Section 
B of Chapter 3 of Diaconis (1988). 

Lemma 2.6.2. Suppose that P is a probability measure defined on a finite group G. Then 

||P-f/||^v < i|G|-||P-f/||^ = i5^rfptr(p(p)P(pr) 

p 

where the sum is taken over all nontrivial irreducible representations of G and P{p)* is the 
conjugate transpose of P{p). 



In the Upper Bound Lemma ( 2.6.2 ), the inequality follows from the Cauchy-Schwarz inequal- 
ity; the equality follows from the Plancherel formula (|2.5.4 ) and Corollary |2.5.2 . 
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2.7. Random Walk on the Symmetric Group. We now return to the random walk 
on the symmetric group defined in Section |2.1| and the proof of Theorem |2.1.3| . We include 



only those portions of the proof that will be needed to conduct our analysis in later sections. 
A detailed introduction to the symmetric group and its representation theory may be found 
in Chapters 1 and 2 of James and Kerber (1981). Another source is Sagan (1991). 

For any tt G Sn, there is an associated ra-dimensional vector a = (ai, 02, . . . , a„), called 
the cycle type of tt, where Oj is the numbers of cycle factors of vr of length i for 1 < i < n. 
According to Lemma 1.2.6 of James and Kerber (1981), two elements of Sn are conjugate 
if and only if their cycle types are identical; hence, there is a one-to-one correspondence 
between these cycle type vectors and conjugacy classes of Sn- Thus, since the transpositions 
in Sn form their own conjugacy class, the probability measure P defined in ( p.l.l| ) is a class 
function. 

A vector [A] = [Ai, A2, . . . , A^] such that Ai > A2 > • ■ ■ > A^ > and Ai + A2 + ■ ■ ■ + A^ = 
is called a partition of n. According to Theorem 2.1.11 of James and Kerber (1981), there 
is a one-to-one correspondence between nonisomorphic irreducible representations of 5*^ and 
partitions [A] of n. Thus we have identified all of the irreducible representations of Sn, over 
which the summation is taken in the Upper Bound Lemma ( p.6.2|) . 



Lemmas p.5.1| and p.4.1| (a) are then used to calculate the Fourier transform of an irreducible 



representation p[A] of Sn, which is 

P(A) 



1 n - 1 
- + r A 

n n 



1 n - 1 ,^ , 
- + r A 

n n 



where r(A) := X[A](T)/rf[A], X[a](t) is the character of p[\] at any transposition r, d[\^ is the 
dimension of p[A], and I is the (i[A]-dimensional identity matrix. 
It then follows from the Upper Bound Lemma ( p.6.2| ) that 

r 1 „ 1 n 2A: 

\\P*'-U\\l^ < \n\\\P*'-U\\l = 15^4] 

[A] 

where the term not to be included in the summation occurs when [A] = [n] , which corresponds 
to the trivial representation of Sn- 

The following formulas, found is Section D of Chapter 3 and Section B of Chapter 7, 
respectively, of Diaconis (1988) are used to calculate the numerical value of the Fourier 
transform. 

Lemma 2.7.1. Suppose that p is an irreducible representation of Sn corresponding to the 
partition [A] = [Ai, . . . , A^] of n. Let r(A) := X[A](T)/rf[A] with r G Sn- Then 
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r(A) 



k 

- (2J-1)A,] and 



nin — 1) ^ 



(i[A] = n\ det 



l<i,j<k 



with 1 /ml := if m < . 



A lengthy, detailed discussion in Section D of Chapter 3 of Diaconis (1988) determines the 
existence of a universal constant a > such that ii k = ^nlogn + cn with c > 0, then 

2k 



IE 4, 

[A] 



1 n — 
- + 

n n 



(2.7.2) 



This completes the proof of Theorem 2.1.3. 



Theorem |2.1.3| shows that k = |nlogn + cn steps are sufficient for the (normalized) £^ 
distance, and hence the total variation distance, to become small. That k = ^nlogn — cn 
steps are also necessary is a result of the following, which is a solution to Exercise 13 in 
Section D of Chapter 3 of Diaconis (1988). The method employed in the proof is a standard 
technique used in problems of this sort. 

Theorem 2.7.3. Let P and U be the probability measures on the symmetric group Sn 
defined in ^2.1.:^) and ^2.1.2^ ), respectively. Let k = |nlogn — cn be a nonnegative integer, 
with c > 0. Then there exists a universal constant a > such that 



ae 



-2c 



Uh > \\P*'-U\\ty > 1 
Proof. Let x be the character of the representation P[n-i,i] of Sn', this representation 



corresponds to the largest term in the summation from the proof of Theorem |2.1.3| . Since 



any element of Sn is conjugate to its inverse, it follows from parts (b) and (c) of Lemma |2.4.1 
that X is real. 



Under the uniform measure U, it follows from Proposition |2.4.4| that 

Euix) = ^ E = {x,Xo)sn = 0, 

nl ^-^ 



where Xo is the character of the trivial representation p[„], and that 



Vaiuix) = Euix' 



nl ^-^ 



TT 



{x,X)t 
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It follows from Lemma |2.7.1| that d[n-i,i] = n — 1 and r([n — 1, 1]) = Thus under the 
fc-fold convolution measure P*^, it follows from Lemma |2.5.1| and (the calculations in the 
proof in Section |2.7| of) Theorem p.l.3| that 



Ep..{x) = 5^ P*^7r)x(7r) = tr P*'{7r)p{n) = tr P^(p) 



[n 



1) 



where p = P[n-i,i]- 

In order to determine Varp.fc(x), we must now calculate Ep,k{)(^). It follows from 
Lemma |2.4.2| that is the character of the representation p[„_i i] ® P[n-i,i]- Recall from 
Proposition |2.3.4] that every representation is the direct sum of irreducible representations; 



in this case, it follows from the example following Lemma 2.9.16 of James and Kerber (1981) 
that we have the isomorphism 

P[n-l,l] ® P[n-l,l] — P[n] © P[n-l,l] © P[n^2,2] © P[n-2,1,1]- 



So it follows also from Lemma |2.4.2| that 

Ep*k{x^) = Ep*k{X[n]) + Ep,k{X[n-l,l]) + Ep,k{X[n-2,2]) + Ep*k{X[n-2,l,l]) ■ 

As above, it follows from Lemma ^.7.1| that (i[„_2,2] = \n{n — 3) and r([n — 2, 2]) = and 
that d[n-2,i,i] = |(^ ~ ~ 2) and r{[n — 2, 1, 1]) = Thus under the /c-fold convolution 



measure P* , it follows from Lemma |2.5.1| and (the calculations in the proof in Section 



of) Theorem |2.1.3| that 



Ep*k{X[n]) = 1, 

Ep*k{x[n-2,2]) = kn{n - 3) (l - |)^'' , and 



Ep,k{xin-2,i,i]) = l{n-l){n-2){l-^y. 
These results combine to show that 

Varp..(x) = l + {n-l){l-l)' + l(n - l)(n - 2) (l - 

- l(n^-n + 2)(l-ff . 

Let k = in log n — cn. By elementary calculus, x < — log(l — x) < for < x < 1. 
Thus, if n > 3 and c > 0, 

Ep,k{x) = {n-l){l-lf > (n - l)e-^'=/("-^) 
where we note that, for n > 3, (l — ^) (^)^''^" "^^ is increasing and e^'^/^""^-' > 1. 
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In order to bound the variance, notice that 

\{n - l){n - 2) (1 - i)' < \{n - l){n - 2)e-^^l^ = 1 (l - | + ^) e^^ and 

1 (^2 _ ^ + 2) (1 - > -n + 2)e-4^'/("-^) 



1 (1_ 1 + ^) m4/("-2) 4c 8c/(n-2)^ 
2\ n J \nJ 



Thus 



when 



|(n-l)(n-2)(l-i)« - lin^-n + 2){l-lY' < 



i(l_3+^U4c _ Wl_ 1 +4Wl)4/(n-2)^4Cg8c/(n-2) < 
2\ n ri'' / 2 \ n n'' / \n / — 

This restriction is equivalent to 

C < ilogn - (^) log (1 - f + + (^)log(l-i + ;^). 

Thus, when c G (0, | logra]. 



Varp,.(x) < 1 + < 1 + (n - l)e-2'=/" = 1+ (l - ^) e^^ < 1 + 



2k/n ^ T ^ h _ iW2c ^ 1 ^ 



Now define Aa '■= {vr G S'„ : ^ ct}- K follows from Chebyshev's inequality that 

X 1 _|_ g2c 

f/(^a) > 1 ^ and that P*\Aa) < provided < a < ^e^'^. Then 



1 1 + e 



2c 



l(n!)^/^||P--f/|h > ||P*'^-f/||TV > 1 (Ae2c_«)2- 
Choosing a = ^e^'^ shows that 

i(n!)^/^||P*'=-f/||2 > \\P*''~U\\ty > 1 - 729e-^' - 14586"^" > 1 - 21876"^^ 
which completes the proof. □ 



The upper bound in Theorem p.l.3| , taken together with the lower bound in Theorem p. 7.3 



gives an example of the so-called "cutoff phenomenon." The total variation distance after 
k steps is nearly 1 until k reaches about |nlogn and then drops precipitously toward 0, 
the dropoff occurring on the relatively small scale of n. For further discussion of the cutoff 
phenomenon see Diaconis (1988). 
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3. Random Walk on the Complete Monomial Groups. 



3.1. Introduction. We now extend the idea of random transpositions of n cards, intro- 



duced in Section |2.1| , to a set of n decks of m cards each and beyond. Imagine n decks of 
cards, labeled 1 through n, in sequential order, each with its m cards in sequential order. 
Independently choose two integers p and q, uniformly from {1, 2, ... , n}. 

If p 7^ g, transpose the decks in positions p and q. Then, independently of the choice of 
p and q and uniformly (i.e., with probability = each), permute the deck terminating 
in position p by a permutation in G = Sm] and independently, also uniformly, permute 
the deck terminating in position q. This procedure is denoted by [v; r), where t E Sn is the 
transposition {p q) and the only possible non-identity entries of {7 G G" = S"^ are in positions 
p and q. The element of v in position p (resp., g) is vr G G = Sm if the deck terminating in 
position p (resp., q) is permuted by tt G G = Sm- 

If p = q (which occurs with probability 1/n), leave the decks in their current positions. 
Then, again independently and uniformly, permute the deck in position p = g by a permu- 
tation in G = Sm- If the order of the deck is changed, this action is denoted by {u; e), where 
e E Sn is the identity permutation and the only non-identity entry of m G G" = 5*^ is in 
position p = q. If the order of the deck is not changed, then this action is of course the 
identity, which is denoted by (e; e). 

If this process is repeated many times, the decks will appear to be in random order, and 
each deck will appear to be randomly permuted. 

The specific example above was for motivational purposes only. In this section we will 
actually examine a random walk on G I Sn for any group G, not just the symmetric group 
Sm- In this more general setting, the example above is equivalent to beginning with a vector 
(e, . . . , e) G G", where each e G G is the identity element. Two elements of this vector are 
then transposed as the decks of cards were above. The transposed elements of this vector are 
then multiplied by elements of G as the individual decks were permuted above. 

We refer to the process on G I Sn described above as the independent shuffles random 
walk, retaining use of the word "shuffles" even when G is not necessarily Sm- A similar 



process, known as the paired shuffles random walk, will be introduced in Section |377 . 

In the special case of the generalized symmetric group I Sn, imagine n wheels, each of 
which may stop at one of m values. The process described above randomly transposes two 
of the n wheels and then independently spins the transposed wheels. We thus refer to the 
process in this case as the independent spins random walk. 
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In the special case of the hyperoctahedral group Z2 I 5'„, imagine n cards, each with an 
orientation (up or down). The process described above randomly transposes two of the n 
cards and then independently flips the transposed cards. We thus refer to the process in this 
case as the independent flips random walk. Schoolfield (1998) used the comparison technique 
to analyze two random walks by comparing them to the independent flips random walk. 

The independent shuffles random walk may be modeled formally by a probability measure 
P on the complete monomial group Gl Sn- Since there are (2) transpositions of n elements, 
then there are \G\^ ■ (2) = ^\G\'^n{n — 1) elements of the form {v; r). There are also r2(|G| — 1) 
elements of the form {u; e) with u ^ e. We may thus define the following probability measure 
on the set of all elements of G I Sn- 



P{e-e) 



1 



\G\n 



P(u; e) = where u ^ e & G^' 

2 

P(v; t) = „ where v e C, 
IGrW 



(3.1.1) 



P{x]it) =0 otherwise, 

where there is only one non-identity entry of m G G", and where if t & Sn is the transposition 
{p q) then the only possible non-identity entries of {7 G G" are in positions p and q. In the 
special case of the hyperoctahedral group Z2 I Sn, we refer to the elements (m; e) as signed 
identities and the elements {v; r) as signed transpositions. 

Since there are IGI" ■ n! elements in G I Sn, the uniform probability measure is given by 

Uix; n) = for (f ; n) E G I Sn. (3.1.2) 

The following result establishes an upper bound on both the total variation distance and 
the £2 dist ance between P*^ and U. We establish an analogous result for the paired shuffles 



random walk as Theorem 3.7.3. 



Theorem 3.1.3. Let P and U be the probability measures on the complete monomial 
group G I Sn definedin ^3.1.1\) and ( \3.1.3[ ), respectively. Letk = |nlogn+^nlog(|G| — l)+cn. 



Then there exists a universal constant b > such that 

\\P*''-U\\ty < i(|G|"n!)^/^||P*'=-f/||2 < be'^" for all c> 0. 

In the following sections we establish the results necessary to prove this theorem and an 
analogous theorem for the paired shuffles random walk. In Section p]2| we study the basic 
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properties of the complete monomial groups. In Section p.3| we show that the probability 



measure defined above is constant on conjugacy classes. In Section 3.4 we identify all of the 



irreducible representations of the complete monomial groups and in Section R]5| we calculate 



the characters of these irreducible representations. In Section |3lB we calculate the Fourier 



transform of the probability measure defined in ( |3.1.1| ) and prove Theorem |3.1.3| along with 



a matching £^ lower bound. In Section |3.7| we perform a similar analysis of the paired shuffles 
random walk; however, the results are quite different in the case that G is nonabelian. 

3.2. The Complete Monomial Groups. The complete monomial group G I Sn is 
the wreath product of the group G with the symmetric group Sn- Special cases include the 
hyperoctahedral group Z2 I Sn, the generalized symmetric group I Sn, and Sm I Sn- The 
elements of G I Sn may be represented as (x; tt) G G" x Sn- It then follows that the order 
of the complete monomial group is |G I Sn\ — 1^1" ' 

Each element (x; tt) ^ G I Sn acts on a vector w G by first permuting its elements 
according to the permutation tt and then left-multiplying the elements of the permuted vector 
by the elements of x, entry by entry; i.e., for any (x; n) E G I Sn and any w G G", 

{x;n){w) = (xi ■ ty^-i(i), . . . ,x„ ■ iu^-i(„)) G G". 

In keeping with the decks of cards analogy, each element (x; tt) G Sm I Sn first permutes 
the n decks of cards according to vr and then permutes each deck of cards according to the 
entry of x at its new index. 

Considering the actions, described above, of the elements of the complete monomial group 
on vectors in G", it follows that the product of two elements (x; vr), {y; a) E G I Sn is given 

by 

{y;a) ■ (x; vr) = {yi ■ x^-i(i), ...,?/„■ x^-i(„); avr) . 

Thus the identity element is (e, . . . , e; e) where (e, . . . , e) G G" and e is the identity permu- 
tation in Sn- It also follows that the inverse of any element (x; vr) G G I S'„ is 

(x; tt) ^ = - ■ ■ , ^T^lny ^ 

There are two very important subgroups of G I Sn which should be considered. The 
subgroup consisting of elements of the form {(x; e) : x G G"} is isomorphic to G" and is a 
normal subgroup of G I Sn- The subgroup consisting of elements of the form {(e, . . . , e; vr) : 
7r G S'„} is isomorphic to Sn- Notice that any element oi G I Sn can be written uniquely as 
the product of a single element from each of these two subgroups. 
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3.3. Conjugacy Classes. Recall that any permutation vr G S'„ can be written as the 
product of at most n disjoint cyclic factors. (For any index not moved by vr, there is considered 
to be a cyclic factor of length one.) For any vr G S'„ there is an associated n-dimensional 
vector, called the cycle type of vr, which lists the number of disjoint cyclic factors of vr of each 
possible length. 

For any (x; n) E G I Sn, the permutation n can be written as the product of disjoint 
cyclic factors, as described above. Suppose that vr is the product of j disjoint cyclic factors 
of lengths {ki, . . . , kj}, respectively, with ki + ■ ■ ■ + kj = n. Then tt can be written as 

..-a ..-a ... (^)^) ...^^) 

where each im G {1, 2, . . . , n}. For each disjoint cyclic factor of vr, we may define 

ge{x]n) = x m ■ Xm Xm, 

which is called the £th cycle product of (x; vr). Notice that g(,{x\ tt) E G for all 1 < £ < j. 

Suppose that G has s conjugacy classes Ci, C2, . . . , C^, with Ci being the conjugacy class 
of the identity element e G G. By calculating ge{x; vr) for all 1 < £ < j, we may construct an 
s X n type matrix a{x; tt) for (x; tt) in the following manner. Let the kth entry of the jth row 
be the number of cyclic factors of length k contained in vr for which ge{x] vr) G Gj. According 
to Theorem 4.2.8 of James and Kerber (1981), two elements of G I Sn are conjugate if and 
only if their type matrices are identical. Notice that the vector of column sums of a(x; vr) is 
the cycle type vector of vr. 

We are primarily interested in the type matrices of the elements in the support of the 
probability measure defined in ( p.l.l| ). In the identity element (e; e) E G I Sn, the identity 
permutation e G S'„ is the product of n disjoint cyclic factors each of length one, and each of 
these cyclic factors corresponds to an identity element in e = (e, . . . , e) G G"". Thus the type 
matrix of the identity element is the s x n matrix with aii(e;e) = n and the remaining 
entries all zeros. 

In the special case of the hyperoctahedral group Z2 I Sn, we have s = 2. Thus the type 
matrix of the identity element is 

. fnOO---0\ 
«(0;e)=^QQQ...Qj. 

Notice that in each (m; e) E G I Sn with u ^ e, the identity permutation e E Sn is again 
the product of n disjoint cyclic factors each of length one. However, u E G^ has a single non- 
identity entry, which corresponds to one of the cyclic factors of the identity permutation. 
Thus, there are s — 1 different type matrices for these elements, each with aii(u; e) = n — 1, 
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with ttkiiu', e) = 1 if the non-identity entry of u is in Ck and the remaining entries all zeros. 
Hence, the elements of the form (u; e) E G I Sn split into s — 1 conjugacy classes. 

In the special case of the hyperoctahedral group Z2 I Sn, the type matrix of each of the 
signed identities, which together form a single conjugacy class, is 

n- 100---0 
1 00---0 

In each [v; t) E G I Sn, the transposition r G S'„ is the product of n — 2 cyclic factors of 
length one and one cyclic factor of length two. Also, each of the n — 2 cyclic factors of length 
one corresponds to an identity e G G in G G", while the only possible non-identity entries 
in -u G G"' correspond to the one cyclic factor of length two. Thus ge{v] r) = for each of the 
n — 2 cyclic factors of length one, but giiv; r) can be an arbitrary element of G for the one 
cyclic factor of length two. Thus, there are s different type matrices for these elements, each 
with an{v; r) = n — 2, with ak2{v] r) = 1 if ge{v] r) G Gk for the one cychc factor of length 
two, and with the remaining entries all zeros. Hence, the elements of the form {v; r) E G I Sn 
split into s conjugacy classes. 

In the special case of the hyperoctahedral group Z2 I Sn, the calculation of the type 
matrix splits the signed transpositions into two sets. Let us refer to a signed transposition 
which flips neither or both of the cards as an even transposition and designate it as {v; r+). 
We will refer to one which flips either, but not both, of the cards as an odd transposition 
and designate it as {v; r~). Thus the type matrices of the even transpositions and the odd 
transpositions are, respectively, 

n-210---0\ ,^ _^ _ /n- 200---0 

00---0j ' "^^'^ ~ \^ 10---0 

We have now identified the type matrices (and, hence, the conjugacy classes) of all the 
elements in the support of the probability measure defined in (|3.1.1| ). Furthermore, we have 
established the following. 

Lemma 3.3.1. The probability measure defined in ( | J. i . j[ j is constant on conjugacy 
classes. 

Knowing that there are the same number of conjugacy classes as there are type matrices 
allows us to calculate the number of conjugacy classes. According to Lemma 4.2.9 of James 
and Kerber (1981), the number of conjugacy classes of G I Sn is equal to 

Vini) ■ p{n2) ■ ■ ■ p{ns) 

(ni,n2,.--,"s) 
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where p{k) is the number of partitions of k (with p{0) := 1) and the sum is taken over all 
s-dimensional vectors such that rii + n2 + ■ ■ ■ + rig = n and rij > for all 1 < j < s. 

We may also calculate the order of each of the conjugacy classes. According to Lemma 
4.2.10 of James and Kerber (1981), the number of elements of a particular type a(x; vr) in 
G I Sn, and hence the number of elements of a particular conjugacy class, is equal to 

IGI" -n! 



n,,[(j|G|/|a|)-.-a.,!] 

where aij is the (i, j)th element of a{x; it) and \Ci\ is the order of the ith conjugacy class of 
G. By applying this result to the elements of G I Sn whose type matrices were determined 
above, we have the following otherwise obvious corollary. 

Corollary 3.3.2. There are n\Ck\ elements of the form {u; e), for 2 <k<s, and there 
are \n{n — 1) ■ |G| ■ |Cfc| elements of the form {v; r), for 1 < k < s. 

3.4. Irreducible Representations. We now construct a collection of irreducible rep- 
resentations of the complete monomial groups from irreducible representations of certain of 
their subgroups. We will later see that every irreducible representation of G I Sn can be 
constructed in such a manner. The method used is from Section 4.3 of James and Kerber 
(1981). Other methods described in Section 8.2 of Serre (1977) and in Chapter V of Simon 
(1996) could be used in the special case when G is abelian. 

Let G* be the subgroup oiG I Sn consisting of all elements of the form {(x; e) : x G G"} and 
let H* be the subgroup of G I Sn consisting of all elements of the form {(e, . . . , e; tt) : tt G Sn}- 



Recall from Section |3]^ that G* is a normal subgroup of G I Sn and that G* = and 
H* = Sn- Recall, furthermore, that each element of G I Sn can be written uniquely as a 
product gh with g E G* and h G H*. 



Suppose that G has s conjugacy classes. Then it follows from Proposition |2.3.1] that there 



are s irreducible representations of G, namely, pi, p2, - - - , Ps, where we choose our labelling so 



that pi is the trivial representation. So it follows from Proposition p.3.5| that any irreducible 
representation of G" is isomorphic to an n-fold tensor product 

where each p^^^ G {pi, p2, . . Ps}- Notice that any irreducible representation of G" can be 
represented by a vector x G {1, . . . , s}", where the entries of x correspond to the indices of 
the factors in the tensor product above. 
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Suppose that the tensor product forming a particular irreducible representation of 
contains nj factors of the kind pj, for all 1 < j < s. The vector (n) = (ni,n2, . . . ,ns) is 
called the type of this representation. Notice that for each type (n) = {ni,n2, ■ ■ ■ ,ns), there 
is a unique representative whose first rii factors in the tensor product are pi, whose next n2 
factors are p2, etc. We will refer to this canonical representative as 

Since G" = G* = {{x;e) : x G G"}, any representation p of G" extends easily to a 
representation of G* by setting p(x; e) = p{x). Thus any representation of G" is also a 
representation of G*. Therefore, the collection {p(n)} is a complete system of irreducible 
representations of G* which are of different types. 

We now turn our attention to H* = {(e, . . . , e; vr) : vr G S'„} = Sn- For each type (n) = 
(^1, 77-2, rig), define S(^n) to be the subgroup of Sn which permutes the first rii indices 
among themselves, the next n2 indices among themselves, etc.; but does not commingle 
these s sets of indices. Thus S(^n) — x x ■ ■ ■ x Sn^, where we define Sn^ := S'l if rij = 
for any 1 < j < s. 

Recall that there is a one-to-one correspondence between irreducible representations of 
Sn and partitions [A] of n, where [A] = [Ai, A2, . . . , A^] with Ai > A2 > ■ ■ ■ > A^ > and 
Ai + A2 + ■ ■ ■ + Afc = n. Thus any irreducible representation of Sn niay be denoted as p[x], 
where [A] is the corresponding partition of n. 



It follows from Proposition |2.3.5| that any irreducible representation of S^ x S'„2 x ■ ■ ■ x S'„ 
is isomorphic to the s-fold tensor product 



P[Ai] ® P[X2] ® ■ ■ • ® P[\s] 

where (A) = ([Ai], [A2], • • • , [A^]) are partitions of {ni,n2, • • • ,n4 and {P[Ai], P[A2], • • • ,P[a»]} 
are irreducible representations of {Sm, Sn2, • • • , Sn^}, respectively. We will denote this tensor 
product as p(A). Thus, there is a one-to-one correspondence between irreducible representa- 
tions of Sn-^ X Sn2 X ■ ■ ■ X Sn^ and ordered s-tuples of partitions of (n) = {rii, n2, ■ ■ ■ ,ns). 
Since S^ x Sn2 x ■ ■ ■ x Sn^ — , then any representation of S^ x Sn2 x ■ ■ ■ x Sn^ is also a 
representation of S'(„). 

Let us define S^^-^ = {(e, . . . , e; vr) : vr G Since S(^n) is a subgroup of Sn, it follows 

that S^^-j is a subgroup of G" I Sn- Since S^^-^ = S(n), any representation p of S'(„) extends 
easily to a representation of 5**^^ by setting p(e, . . . , e; tt) = p(7r). Thus any representation 
of S(n) is also a representation of S^*^-). Therefore, for fixed type (n) = {ni,n2, . . . ,ns), the 
collection {p(a)} is a complete system of irreducible representations of S^^^, when (A) ranges 
over all ordered s-tuples of partitions of (rii, n2, . . . , n^), respectively. 



RANDOM WALKS ON WREATH PRODUCTS OF GROUPS 



23 



Now consider the wreath product G I S'(„), which is a subgroup of G I Sn- The representa- 
tions p(„) : G* GL(Fi) and : S^^ GL(l^2), where (A) = ([Ai], [A2], . . . , [A,]) are 
partitions of (n) = (ni,n2, . . . respectively, may be combined to form representations 
P{n) ® P{\) of G I S(^n) by defining (with an abuse of the notation "(g)," since this is not quite 
the same notion of tensor product as in Section |2.3|) 



{p{n)® P{\)){x]'k){vi®V2) := P(„)(5;e)(i;i) (g)p(A)(0;7r)(t;2) 

for all (x; n) E G I S(^n) and all Vi G Vi and V2 G V2. These representations {p(n) ® P(a)} of 
G I S^n) may now be used to induce representations of G I Sn- More importantly, we have 
the following, which is Theorem 4.4.3 of James and Kerber (1981). 

Lemma 3.4.1. The collection of representations of G I Sn induced by the collection 
{P{n) ® P(A)} of representations of G I S(^n) is a complete collection of pairwise inequivalent 
and irreducible representations of G I Sn if (n) = {ni,n2, . . . ,ns) ranges over all different 
types and, for fixed type (n), (A) = ([Ai], [A2], • . • , [A^]) ranges over all ordered s-tuples of 
partitions of (rii, n2, . . . , Ug), respectively. 

Since the number of irreducible representations equals the number of conjugacy classes 
(Proposition p.3.1|) , it follows from the results in Section p.3| that the number of irreducible 



representations oi G I Sn is equal to 

^ p{ni) ■ p{n2) ■ ■ -pins), 

{ni,n2,...,ns) 

where p{k) is the number of partitions of k (with p{0) := 1) and the sum is taken over all 
s-dimensional vectors such that rii + n2 + ■ ■ ■ + rig = n and Uj > for all 1 < J < s. This is 
Corollary 4.4.4 of James and Kerber (1981) and is consistent with the results found above. 

3.5. Irreducible Characters. For the elements in the support of the probability mea- 
sure defined in (|3.1.1| ), we now determine the characters of the irreducible representations of 
the complete monomial group G I Sn induced by the irreducible representations of G I S'(„) 
found in Lemma 3.4.1 . We do this with the aid of Lemma 2.4.3| . The Frobenius character 



formula, which is Theorem V.4.1 of Simon (1996) or Theorem 12 of Serre (1977), may also 
be used. 

For notational purposes, let Cf '"^ (with 2<k<s)he the conjugacy classes of {u; e) in 
G I Sn, with k chosen so that the single non-identity entry of u is in conjugacy class Gk of 
G, and let '"^ (with 1 < A; < s) be the conjugacy classes of {v; t) in G I Sn, with k chosen 
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so that the product (calculated as in the determination of the type matrices in Section |3.3| ) 
of the two possible non-identity entries of v is in conjugacy class Ck oi G. 

Lemma 3.5.1. For the elements in the support of the probability measure defined in 



( \3.1.1\ ), the character of the irreducible representation p of the complete monomial group 
G I Sn induced by the irreducible representation p(^n) ® P(a) of G I Si^n) is given by 

n 

ni, . . . ,ns 



Xp(e; e) = 



d 



pi 



Xp{V\T) 



s 

dpY. 



Xpj {9k) 



for {u-e)ECf-^^\ 



nj{nj — 1^ 
n{n — 1) 



Xpj{9k) 



d 



Pi 



r(A,) 



for {v;T)eGt\ 



where, for 1 < j < s, Xpj is the character of the irreducible representation pj of G and X[\j] is 
the character of the irreducible representation p[\.] of Sn^, where r{\j) := XlXj]{'T) / d-ix^] with 
transposition t E Sn-, and where Qk is any element of the conjugacy class Gk of G. 



Proof. Let G := G l Sn and H := G I S(^„y, we apply Lemma p.4.3| to these groups. 
Notice that |G| = \G I 5„,| = IG]'^ ■ n\ and that \H\ = \G I = \G I x ■ ■ ■ 5„J| = 

|G|"-ni!---nJ. Thus 

\G\ ICr -nl _ / n 

\ni,...,ns 



\H\ \G\^-n,\ 



Recall, in the representation p(„) ® p(^x) oi H = G I S{n)^ that p(„) is the n-fold tensor 
product 

p« ®p(2) ...®pH, 

where each p*^*^ G {pi, p2, . . . , ps] is an irreducible representation of G and the first ni factors 
are pi, the next n2 factors are p2, etc. Also recall that p(A) is 

P[Ai] ®P[\2\ ® •■■®P[A,], 

where P[Ai], P[A2]) • • • > P[\s\ irreducible representations of S'„^, S'„2, • • • , >5'„^, respectively. It 



then follows from Lemma |2.4.2| that the character of the representation p(„) ® p(^x) is given 
by 



Xh = X^^^ ■■ - X^"^ • X[Ai] ■■■X[\s]- 
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We begin with the identity element (e; e) G G = G I S'„. Since the identity element forms 
a singleton conjugacy class, we have i = 1. Furthermore, since the only element of H to 
which it is conjugate in G is the identity element (e; e, . . . , e) G = G I S(^n), we have t = 1 
and ki = 1. 

Recall that the character at the identity of any representation is the dimension of the 
representation. Thus 

x^'\e)---x'^'^\e) = d;i---d;:. 

So the character of the identity element {e;e,...,e)EH = G I S'(„) is 

t^P^ ■■■c^p: ■X[Ai](e)---X[A.](e). 



Therefore, it follows from Lemma |2.4.3| that the character at the identity element in the 



induced irreducible representation p of G I S'„ is 



n 



Xp(e;e)= ( ^ ] ^T, ■ ■ ■ d^l ■ X[x,]{e) ■ ■ ■ X[Xs 



and, furthermore, that the dimension of the induced irreducible representation p of G I Sn 
is 

n 

.rii, ...,ns 



We now consider the elements (m; e) G G = G I Sn with m 7^ e , which comprise s — 1 
conjugacy classes: G2^'^\ C^''^\ . . . , Gs^''^\ where the indices are chosen so that, for G'^'^\ the 
single non-identity entry of u is in conjugacy class Ck of G. For a particular conjugacy class 
C^'^\ since there are n\Ck\ elements in G, we have d = n\Ck\- Each of the s — 1 conjugacy 
classes splits into s classes in H: in the first class, the only non-identity entry of u is one of 
its first ni entries; in the second class, the only non-identity entry of u is one of its next n2 
entries; etc. Thus t = s with kj = nj\Ck\ for all 1 < j < s. 

For a particular conjugacy class Ck of G, let Qk be a representative element. Then the 
characters of the s classes of elements (m; e, . . . , e) G if = G I S(^n) are 

Xp, {9k) 



Pi 



dn 



"Pj 

for 1 < j < s. 



Therefore, it follows from Lemma |2.4.3| that the character at (m; e) of the induced irre- 
ducible representation p of G ? S'„ is 



n 



Xp, {9k) 



/ j=i 
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for 2 < k < s, from which the desired result follows. 

Finally, we consider the elements {v;t)eG = G I Sn, which comprise s conjugacy 
classes: c[^''^\ C2''^\ • • • , ci^''^\ where the indices are chosen so that, for Cj^''^\ the product 
(calculated as in the determination of the type matrices in Section p.3| ) of the two possible 



k\ 



non-identity entries of v is in conjugacy class Ck of G. Since there are ^n{n — 1) ■ \G\ ■ \Gi 
elements in the conjugacy class Gj^''^'^ of G, we have i = ^n{n — 1) ■ |G| ■ \Gk\- 

Each of the s conjugacy classes splits into s classes in H: in the first class, r transposes 
two of the first rii elements leaving other elements fixed; in the second class, r transposes two 
of the next n2 elements leaving the other elements fixed; etc. Also, in the first class, the only 
non-identity entries of v are in its first rii entries; in the second class, the only non-identity 
entries of v are in its next n2 entries; etc. Thus t = s with kj = \nj{nj — 1) ■ |G| • \Gk\ for all 
1 < J < s. 

For a particular conjugacy class Gk of G, let be a representative element. Then the 
characters of the s classes of elements {v;t) & H = G I S(^n) are 

Xpjidk) X[Xj]{r) 



V. ^P. -^[A,---"[A. ^^^^ 



for 1 < J < s. 



Therefore, it follows from Lemma |2.4.3| that the character at (v\ t) of the induced irre 



ducible representation p of G I Sn is 

n \ ^ rijjnj - 1) ^ ^ Xp,{9k) X[x,]{r) 

j=i - y- ^ d\x^ 

for 1 < A; < s, from which the desired result follows. □ 



3.6. Analysis of the Independent Shuffles Random Walk. In order to continue 
our analysis of the independent shuffles random walk introduced in Section |3]l|, we must 
now calculate the Fourier transform of P at each irreducible representation of the complete 
monomial group G I Sn- 



Lemma 3.6.1. Let P he the probability measure on G I Sn defined in ( ^.1.1\ ). For the 



irreducible representation p of G I Sn induced by the representation p(^n) ® P(x) of G I S(n), 
the Fourier transform is 



Pip) 



ni ni{ni - 1) 

— + 2 ^(^1 



where {n) = {ni, . . . ,ns), (A) = ([Ai], . . . , [A^]), and r(Ai) = X[Ai](t)/c?[Ai] with transposition 
T e Sn^. 
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Proof. Recall from Lemma [3. 3. 1| that P is constant on conjugacy classes. It then follows 



from Lemma |2.5.1| that P{p) = C ■ I, where C is a constant. By applying the results from 
Corollary p.3.2| and Lemma |3.5.1| , we find that 



C 



\G\n 



k=2 



n / do 



n{n — 1) 



■\G\-n\T'-^ 



nj{nj-l) Xp,{9k) 



2 ' ' ' nin-1) d,^ 



1 ^ s s 



\G\n 



IGln^ ^ do- 

j=l Pj k=2 



+ 



1 

'U2 ^ 



\G\n 



d 



Pj 



k=l 



When j = 1, since pj is the trivial representation of G with dp. = 1, we have \Ck 



k=l 



XpM) = Yl l^fcl = 1^1 Yl l^fcl ■ ^P^^9k) = \G\ - \Ci\ = \G\ - 1. When 2 < j < s, 



k=l 



k=2 



Xpjigi) = -dpr Thus 



k=l 



it follows from Proposition |2.4.4| that \Ck\ ■ Xpjidk) = and \ Ck\ ■ XpjiOk) = —\Ci 

k=2 

n^in, - l)ri\,)\G\ 



C 



\G\n \G\n^ 
111 ni(ni — 1 



n,i\G\-l) ^ in-n,)i-l) 



\G\n^ 



\G\n^ 



r(Ai). □ 



By applying the results from Lemmas |3.5.1| and |3.6.1| to Lemma p.6.1| , we determine all 
the eigenvalues of the transition matrix P induced by the probability measure P, together 
with their multiplicities. 



Corollary 3.6.2. Let P be the probability measure on G I Sn defined in ( \3.1.1\ ). Let 
P be the transition matrix of the Markov chain induced by the probability measure P. Then, 
for the irreducible representation p of G I Sn induced by the representation p(^n) ® P{x) of 
G I S(n), there is an eigenvalue Hp of P occurring with algebraic multiplicity 

\ ^ 
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such that 
where {n) 



[rii, . . . ,ns), (A) = ([Ai], . . . , [A^]), and r(Ai) = X[Ai](i")/rf[Ai] with transposition 



We have now established the resuhs necessary to prove Theorem ^.1.3| . 



Proof of Theorem |3.1.3| . By applying the results from Lemmas p.5.1| and p.6.1| to 
the Upper Bound Lemma ( |2.6.2| ), we find that 



I p*fc rrllS ^ 1 
l-T — U IItv ^ 4 



< \{\Grn\)\\P*''-U\\l = \Y,d^ 

p 



ni ni{ni — 1) 



+ 



r(Ai 



2k 



where the sum is taken over all nontrivial irreducible representations of G I Sn- Thus we 
have 



\P*^-U\\\^ < -A\G\''n\)\\P*^ -U\\l 



(n) (A) 



n 



ni, . . . 



pi Ps [AiJ [As 



ni ni{ni — 1) 



-I 2k 



+ 



n / \ 2 

n 



ni=0 ^ [Ai] 



Hi ni{ni - i; 



+ 



■r(Ai 



2k 



r(A, 



(n2,...,n.) ^ ^' ' ([A2],...,[As]) 



Consider the direct product Sn^ x ■ ■ ■ x Sn^ ■ It follows from Proposition |2.3.5| and Lemma 



2.3.2 that the sum of the squares of the dimensions of all the irreducible representations of 



X ■ ■ ■ X S^g is 



C^fAa] ■ ■ '^f\s] ~ l'^"2 X ■ ■ ■ X SnJ — ■ ■ -n^!. 

{[A2],...,[A.]) 



Using the multinomial theorem and Lemma 2.3.2, we also have 



E 

(n2,.--,«s) 



n — ni 
n2, . . . ,n, 



= « + ■■■+<) 



,2 \ra-ni 



Thus 
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(n2,...,n,) V 2' • • • ' sj ([A2],...,[A,]) 

which combines with the results above to give 

\\P-''-V\\w < U\G\"n\)\\P-''-U\g 



2k 



(3.6.3) 



ni=0 ^ [Ai^ 

where the inner sum is taken over all partitions [Ai] of rii. The trivial representation of 
G I Sn, which should not be included in the summations above, occurs when rii = n and 
[Ai] = [n] (which gives the trivial representation of Sn)- 



For each 1 < ni < ra, it follows from (|2.7.2|) that we may bound the inner sum above, with 
[Ai] = [rii] excluded, by 



[Al] 



rii ni^rii — 1) 



r(Ai 



2k 



n J 



4k 



[Al] 



— + r Al, 

Til rii 



2k 



< 



rii 



n 



4k 



2 -4c 



for a universal constant a > 0, when k > ^rii log^i + crii. Since n > rii and |G| > 2, this is 
also true when k > |nlogn + ^nlogdCI — 1) + cn. 

We must also bound the term for the trivial representation [Ai] = [rii] ioi 1 < rii < n — 1. 
Since in these cases, rf?^ , = 1 and r(Ai) = 1, we have 



1A 



2k 



rii ni{ni — 1) 



— + 



2k 



rii 



n 



4k 



These results lead to the upper bound 

\\p*k _ f/||2^ < 1 (Id^n!) \\P*^ - U\\ 



HE(:)^(i«i-i)-E4. 

ni=0 ^ ^ [Al] 



Hi ni{ni — 1) 



rii 



n 



— + 



4k 



r(Ai 



2k 



ME(:)^(i«i-ir-(f 

ni=l ^ ' 



4k 
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Now notice that, when k = |nlogn + irzlogdCI — 1) + cn, then 



n ) 



'^^N -n[-21og(n)-log(|Ghl)-4c] 

n J 

which combines with the results above to give 



-4c 



-n log(ni /n) 



l-P*' - Clllv < 7 (|G|"n!) IIP*'' - U\\l 



n\=l 

n-l 



n \ n\ 



{\G\ - 1)" 



-4c 



n\=\ 



* \ nW ni! 



- l)n2 



-4c 



(IGI-IKJ 



-nlog(ni/n) 



-nlog(ni/n) 



If we let 2 = n — ni, it follows that 



\P*^ - f/|||v < \ (IGTri!) ||P*^ - f/||2 



i=0 i=0 ^ ^ 



<a2e-^'=exp(e-^'=) + 
Since c > 0, we have exp(e~^'^) < e. Therefore 

l|P*'-f/|lTv < idcrm: 

from which the desired result follows. □ 



-4c 



IP 



*fe 



u\ 



exp (e" 

< 



4c^ 



-4c 



Theorem |3.1.3| shows that k = |nlogn + ^nlogdCI — 1) + cn steps are sufficient for the 



(normalized) i"^ distance, and hence also the total variation distance, to become small. A lower 
bound in the (normalized) i"^ metric can also be derived by examining ^^(|G| — 1) (l — ^)'^'' , 
which is the dominant contribution to the summation ( |3.6.3| ) from the proof of Theorem p.l.3| . 
This term corresponds to the choice rii = n — 1 with [Ai] = — 1]. Notice that k = 
^nlogn + ^nlogdCI — 1) — cn steps are necessary for just this term to become small. 

That k = |nlogn — cn steps are necessary for the total variation distance to become 
small follows directly from Theorem 2.7.3 , as follows. Recall that Theorem 2.7.3| shows that 



k = ^nlogn — cn steps are necessary for the total variation distance to uniformity to become 
small for a random walk generated by random transpositions from the symmetric group Sn- 



This is exactly the random walk on G I Sn introduced in Section |3.1| , if the vector x from 
{x; it) & G I Sn is ignored. Thus Theorem 2.7.3| provides a lower bound on the distance to 
uniformity in the total variation metric, and hence also in the (normalized) i'^ metric. 



RANDOM WALKS ON WREATH PRODUCTS OF GROUPS 



31 



In the special case of the hyperoctahedral group Z2 I S'„, we have \G\ = 2. Thus the upper 
bound in Theorem |3.1.3| matches the lower bound derived from Theorem p.7.3 . 

That k = ^nlogn + cn steps are also sufficient for the total variation distance to become 
small is a result of the following. 

Theorem 3.6.4. Let P and U he the probability measures on the complete monomial 
group G I Sn defined in ^3.1.^ ) and ^3.1.^ ), respectively. Let k = ^nlogn + cn. Then there 



exists a universal constant b > such that 

\\P*''-U\\ty < he-^" for all 00. 

Proof. The probability measure P on G I Sn induces probability measures Q on 
and R on Sn by defining 

Q(f) := ^P(f;7r) and P(7r) := ^P(f;7r). 

7r65„ xeG" 



Notice that R is the probability measure on Sn defined in (|2.1.1 



Recall that P is the one-step distribution for a random walk W = (PVo; W^i? W^2, • • •) on 



G I Sn, as described in Section Define stochastic processes X = (Xq, Xi, X2, . . .), with 
state space G", and Y = (Yq, ^i, ^2, ■ ■ ■), with state space Sn, by setting Wk =: (X^; Y^) for 
A; > 0. It is not hard to see that X (resp., Y) is a random walk on G" (resp., on Sn) with 
one-step distribution Q (resp., R). 

For each 1 < i < n, let Tj be the step index k for the random walk W at which the 
element Xi E x is first multiplied by a uniformly chosen random element of G, as the result 
of an element either of the form {u; e) or of the form {v; r). Define T := max Tj. Thus T is 

l<i<n 

the step at which the last element of x is randomized, and 



Notice that 



F{T>k} < J2^{T,>k} = < 

P*^{x- vr) > P {Wk = (f ; tt), T < k} 
^ -P{n = 7r,T< A;} 



G| 

\r-^[R*\7i) - P{yfc = 7r,T>A;}]. 



IG 



Thus 
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-)*k 



f/||Tv = m^x[U{A)-P*\A)] 



< 



< 



max 

ACG 



max 

ACG 



E 
E 



1 



{x;tt)&A 



R 



*k I 



+ 



\G 



E 

(x;7r)eG i Sn 



\G\ 



;F{Yk = n,T>k} 



\R* 



UsJtv + P{T>fc}. 



Let k = log n + cn. It follows from Theorem |2.1.3| that there exists a universal constant 
a > such that \\R*''-UsJ TY ^ CLG ^ for all c > 0, where Us,^ is the uniform distribution 
on 5*^ as defined in (|2.1.2|) . Furthermore, it follows from above that F {T > k} < e^'^^. 
Therefore 



\\P - f/||TV 

from which the desired result follows. □ 



< 



(a + l)e 



-2c 



Notice that, for the independent shuffles random walk, the rate of convergence to uni- 
formity for the (normalized) £^ distance is slightly slower (due to the addition of the term 
\n log(|G| — 1)) than that for the total variation distance. However, if |G| is moderate relative 
to n, these rates of convergence are "essentially" the same. 

The following table summarizes the number of steps (both necessary and sufficient) for the 
distance (both normalized £^ and total variation) to uniformity to become small for various 
special cases of the independent shuffles random walk analyzed in this section. 
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Random walk on G I Sn 
(with independent randomizations) 


G 


metric 


nec. or suff. 


number of steps 


proof 


Z2 




sufficient 


in log n 


Thm. 


necessary 


|n log n 


pf. of Thm. 


3.1.8 


TV 


sufficient 


in log n 


Thm. 


3.1.3 


necessary 


in log n 


Thm. 


2.7.3 




e 


sufficient 


in log n + in log(m - 


-1) 


Thm. 


3.1.3 


necessary 


in log n + in log(m - 


-1) 


pf. of Thm. 


3.1.3 


TV 


sufficient 


in log n 


Thm. 


3.6.4 




necessary 


in log n 


Thm. 


2.7.3 




e 


sufficient 


in logn + inlog( m 


1-1) 


Thm. 


3.1.3 


necessary 


in log n + inlog( m 


1-1) 


pf. of Thm. 


3.1.3 


TV 


sufficient 


in log n 


Thm. 


3.6.4 




necessary 


in log n 


Thm. 


2.7.3 


G 

abelian 


e 


sufficient 


inlogn + inlog( G 


-1) 


Thm. 


3.1.3 


necessary 


in log n + inlog( G 


-1) 


pf. of Thm. 


3.1.3 


TV 


sufficient 


in log n 


Thm. 


3.6.4 




necessary 


in log n 


Thm. 


2.7.3 


G 

nonabelian 


e 


sufficient 


inlogn + inlog( G 


-1) 


Thm. 


3.1.3 


necessary 


in logn + inlog( G 


-1) 


pf. of Thm. 


3.1.3 


TV 


sufficient 


in log n 


Thm. i.^A 




necessary 


in log n 


Thm. 


2.7.3 
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3.7. Analysis of the Paired Shuffles Random Walk. We now describe a slight vari- 



ant of the independent shuffles random walk introduced in Section |3.1| . This will provide 
a second benchmark random walk, with known rate of convergence, on G I Sn for use in 
the comparison technique. Schoolfield (1998) analyzed a random walk for which compar- 
isons to the independent shuffles and paired shuffles random walks, in the special case of the 
hyperoctahedral group Z2 I Sn, gave different bounds. 

Imagine n decks of cards, labeled 1 through n, in sequential order, each with its m cards 
in sequential order. Independently choose two integers p and q, uniformly from {1,2, ... ,n}. 

If j9 7^ g, transpose the decks in positions p and q. Then, independently of the choice of 
p and q and uniformly (i.e., with probability = ~t each), permute the deck terminating 
in position p by a permutation tt G G = Sm and permute the deck terminating in position 
q by 71^^ E G = Sm- Notice that the only elements of the form {v; r) that occur in this 
combination of operations are from the single conjugacy class C[^''^\ The probability that 
an element from any of the the other s — 1 conjugacy classes occurs now vanishes. 

If p = q (which occurs with probability 1/n), leave the decks in their current positions. 
Then, again independently and uniformly, permute the deck in position p = g by a per- 
mutation in G = Sm- The probabilities of the identity and of the elements {u; e) are thus 
unchanged from the independent shuffles random walk. 

Again, as in Section |3.1| , we will actually examine a random walk on G I Sn for any 
group G, not just the symmetric group Sm- In this more general case, the example above 
is equivalent to beginning with a vector (e, . . . , e) G G^, where each e G G is the identity 
element. Two elements of this vector are then transposed as the decks of cards were above. 
The transposed elements of this vector are then multiplied by elements of G as the individual 
decks were permuted above. 

We refer to the process on G I Sn described above as the paired shujfles random walk, again 



retaining use of the word "shuffles" even when G is not necessarily Sm- As in Section qA. 



the paired spins and paired flips random walks are defined in an analogous manner in the 
special cases of the generalized symmetric group I Sn and the hyperoctahedral group 
Z2 I Sn, respectively. 

The paired shuffles random walk may be modeled formally by a probability measure Q on 
the complete monomial group G I Sn- We may thus define the following probability measure 
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on the set of all elements of G I 5'„: 



Q(e; e) 
Q{u] e) 
Q{v-t) 



1 
1 
2 



where u ^ e E G", 
where {v\t) G Cj^'^"*, 



(3.7.1) 



|G|n2 

tt) =0 otherwise, 

where there is only one non-identity entry of m G G", and where if r G is the transposition 
{p q) then the only possible non-identity entries of ■u G are in positions p and q and these 
entries are mutually inverse elements of G. In the special case of the hyperoctahedral group 
Z2 I Sn, the conjugacy class g[^'^^ is the even transpositions. 

In order to continue our analysis of the paired shuffles random walk, we must now calculate 
the Fourier transform of Q at each irreducible representation of the complete monomial group 

G I Sn- 



Lemma 3.7.2. Let Q be the probability measure on G I Sn defined in 7.1\) . For the 
irreducible representation p of G I Sn induced by the representation p(n) ® P(\) of G I S(n), 
the Fourier transform is 



Q{p) 



where (n) = (rii, . . . .Ug), (A) = ([Ai], . . . , [A^]), and r{\j) = X[Aj](t)/'^[Aj] with transposition 
T e Sn,- 



Proof. Notice that Q is constant on the conjugacy classes of G I Sn- It then follows 



from Lemma 2.5.1 that Q{p) = G ■ I, where G is a constant. By applying the results from 
Corollary p.3.2 and Lemma |3.5.1 , we find that 
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' ' k=2 ' ' j=l 

2 n(n - 1) s^ nj{nj-l) 



n(n — 1) 



\G\n \G\n^j~ld 



k=2 j=l 



1 ni{\G\-l) {n-ni){-l) ^ nj{nj - 1 



\G\n \G\n^ IGlnS ^ 

77-1 — 1 



By applying the results from Lemmas p.5.1| and p.7.2| to Lemma p.6.1| , we determine all 



the eigenvalues of the transition matrix Q induced by the probability measure Q, together 
with their multiplicities. 

Corollary 3.7.3. Let Q be the probability measure on G I Sn defined in l \3.7.1\ ). Let 
Q be the transition matrix of the Markov chain induced by the probability measure Q. Then, 
for the irreducible representation p of G I Sn induced by the representation p(^n) ® P{\) of 
G I S(^n), there is an eigenvalue 7Cp of Q occurring with algebraic multiplicity 

2 



n 

ni,...,ns 



such that 



" ^2+2^ ^2 n^,;, 



where (n) = (rii, . . . ,ns), (A) = ([Ai], . . . , [A^]), and r{\j) = ](r)/(i[A ] with transposition 

The following result establishes an upper bound on both the total variation distance and 
the £^ distance between Q*^ and the uniform distribution U on G I Sn- The total variation 



upper bound is rather poor when G is nonabelian, as shown by Theorem |3.7.6| . The quality 
of the £^ upper bound will be discussed following the proof of the theorem. 
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Theorem 3.7.4. Let Q and U be the probability measures on the complete monomial 
group G I Sn defined in ^3. 1. \ ) and ^3.1.1\ ), respectively. Let 

k = max |n logri + logdGI — 1) + |nlog(s — 1), log5„} + 2cn, 

s 

where Sn '■= ^^'^p" '^'^^ dp. is the dimension of the irreducible representation pj of G for 
i=2 

2 < J < s. Then there exists a universal constant b > such that 

\\Q*^~U\\Ty < H|G'rn!)'/'||g*^'-f/||2 < be~^' for all OO. 



Proof. By applying the results from Lemmas |3.5.1| and |3.7.2| to the Upper Bound 
Lemma (|2.6.2| ), we find that 



\\Q*'-Ur^v < \{\Grn\)\\Q*'-U\\l = \Y,dl 

p 



rii , ^Ujirij-l^ 



.r(A, 



2k 



(n) (A) 



n 



ni 



2k 



(3.7.5) 

where the sums are taken over all nontrivial irreducible representations of G I Sn- Notice 
that 



ni 



-^^2 '"(^i) 



2k 



ni ni(ni — 1) 



+ 



r(Ai 



- 1) 

J=2 



2k 



ni\' 
n ) 



1 ni - 1 

\ r(Ai 

77-1 ni 



2k 



j=2 1 



< max 



n ) 



2k 



— + r Ai, 

ni ni 



2k 



max 

2<j<s \ n 



2k 



^3 - 1 

n~ 



2k' 



< 



rii 



n 



2k 



\ r(Ai 

Til ni 



-I 2k 



+ E 



i=2 



n 



2k 



-I 2k 



2k 



where the first inequality is due to the fact that j ctj^j 1 ^ max x'^'^ whenever each 

\i=i / 

a.j > and ai + ■ ■ ■ + = 1. As noted in the proof of Theorem 5 in Section D of Chapter 3 
of Diaconis (1988), to every representation (A^) there corresponds a conjugate representation 
(A^) such that r(Aj) = — r(Aj). So we have 
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2k 



[A,]:r(Aj)>0 



n 2A: 



Thus 



E 



r(A,) 



n 2k 



S2 E 

[A,]:r(A,)>0 



2k 



£2E 



1 rii — 1 
— + — 



r(A,) 



2k 



which combines with the previous results to give 
\\Q*'-Ur^y < li\Grn\)\\Q*''-U\\l 



< 



^EE 

(n) (A) 



n 

ni, . . . 



i s 

d'lT ■ ■ ■ ^IT ■ ^[Ai] ■ ■ ■ 4a.] 5Z 



2k 



1 rij — I 

— + — — 



2k 



+ iE<'( 



n — 1 



2fc 



where the sum over (A) = ([Ai], . . . , [A^]) is taken over all partitions of (ni, . . . ,ns), except 
that we omit the trivial partitions [A^] = [n] for 1 < j < s. The final term reintroduces the 
appropriate terms for [Xj] = [n] for 2 < j < s. 

Continuing as in the proof of Theorem p.l.3| , this may be simplified to 

\\Q*'-U\\ly < H\Grn\)\\Q*'-U\\l 



j=l nj=0 ^ ■> 



^ ^EEC lT(N-4r"'E4.,e 



[\j]\-nj 



n- X 2k 
n 



1 rio — 1 , ^ , 
- + r{Xj) 



2k 



+ IE«'( 

j=2 ^ 



n — 1 



n 



2k 



ni=0 



^ ^E(:j^(i«i-i)"-"'E4.,e 



2k 



[\j]\-nj 



1 — 1 
— + 



r(A,) 



2k 
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where, for 1 < J < s, the sum is taken over all partitions [Xj] of rij, with [Xj] = [n] 

[\j]\-nj 

s 

excluded when Uj = n, and where 5„ = c?^". 

i=2 

Recall from the proof of Theorem p.l.3| that, when k > \ogn + log {\G\ — 1) + c'n, 
we may bound the inner sum above using 



[Ai] 



n ) 



2k 



1 ni-l 

111 Til 



r(Ai 



2k 



n 



2k 



2k 



ni 
n 



2k 



\ '^(Ai 

rii rii 



2k 



for 1 < rii < n — 1, and using 

[Ai] 

for rii = n. So this is also true when k > n\ogn + inlogdGI — 1) + \n\og{s — 1) + 2cn, in 
which case we may choose c' = \ \ogn + \ logdCj — 1) + | log(s — 1) + 2c. Thus 

-8c 

-Ac' ^ 



(|G| 1)2^2 

Now notice that when k > n\ogn + InlogdCj — 1) + |nlog(s — 1) + 2cn, 



rii 



2k 



— ) < 

n 



-4c 



-nlog(ni/ra) 



L(|G|-i)(.-iKj 

These results lead to the upper bound 

W-U\\\^ < \{\G\-n})W-U\\l 



< 2sa 



-8c 



E 



ni=l 



n \ nl 



{\G\ - \r 



-Til 



-Ac 



ni=l 



n—ni 



-Ac 



-Ac 



L(|G|-l)(s-l)n2j 



L(|G|-I)(s-1K 



L(|G|-I)(s-1K 

—n log(ni/n) 



-n log(ni/n) 



Continuing as in the proof of Theorem p.l.3| , we find that 
\\Q*'-Ur^y<\{\Grn\)\\Q*'^-U\\l 



(|G|-l)(s-l)2n2 



e-^^exp (e"^") + | [^] e'^^exp (e"^") 
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Since c > 0, we have exp(e < e. It then follows that 
\\Q*'-U\\ly<li\Grn\)\\Q*'^-U\\l 

< [{4a' + 1) e] e-'^ + 



_4(|G| -l)(s-l)n2_ 

s 

Recall that when G is abelian, 6n = ^^"^p" = l*^! ^ 1 fo^ this case, the proof is 

complete. 

When G is nonabelian, 5„ increases exponentially with n. We must thus reexamine the 
term j 6n I I • Notice that when k > log5„ + 2cn, 



n 



Therefore, choosing k = max {ra logra + |?t, logdCj — 1) + log(s — 1), ^nlog5„} + 2cn 
completes the proof. □ 

Theorem |3.7.4| shows that k = max ^^n \ogn+^n \og{\G\ — l) + jn log(s — 1), |nlog5„|+2c?T, 
steps are sufficient for the (normalized) i"^ distance, and hence the total variation distance, 
to become small. When ?T,logn + |nlog(|G| — 1) + |nlog(s — 1) > ^n\og6n (which is always 
the case when G is abelian), a lower bound in the (normalized) i'^ metric can also be derived 
by examining 

which, in this case, is the dominant contribution to the summation ( |3.7.5| ) from the proof 
of Theorem |3.7.4| . This comes from summing (over 2 < j < s) the terms corresponding 
to the choice rii = n — 1 with [Ai] = [n — 1] and nj = 1 with [Xj] = [1]. Notice that 
k = ^nlogn + i^logdCI — 1) — cn steps are necessary for just this term to become small. 

Notice that, in this case, since s < \G\, our upper [nlogn+lnlogdCj — l) + |nlog(s— 1)] and 
lower [|nlogn+ ^nlogdCI — 1)] bounds on the number of steps required for the (normalized) 
i'^ distance to become small differ by at most a constant factor, but we have not been able 
to close this gap. However, this gap will pose no problems in the implementation of the 
comparison technique, since its results are accurate only up to a constant factor. Nonetheless, 
no such gap exists for total variation distance, as will be shown later. 

When nlogn+ ^nlogdCI — 1) + |nlog(s — 1) < ^n\og6n, a matching lower bound in the 
(normalized) £^ metric can also be derived by examining 
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\ n 

which, in this case, is the dominant contribution to the summation ( |3.7.5| ) from the proof of 
Theorem ^■7.4| . This comes from summing (over 2 < j < s) the terms corresponding to the 
choice Hj = n with = [n]. Notice that k = |nlog5„ — cn steps are necessary for just this 
term to become small. 

For fixed nonabelian G, notice that d"^^^ < < {s — l)d'^^^, where dmax := max dp.. So 

2<j<s 

2?7,log(imax < log 5n < 2?7. log (imax + log(s — 1). Thus, in this case, we have thereby deter- 
mined that very nearly log (imax steps are necessary and sufficient to make (normalized) 

s 

distance small for a fixed nonabelian group G. How large is dyaa.x'^ Since d'^p. = \G\ — 1, 
somewhat crude bounds are 



s - 1 



< Cimax < (IGl-l 



,1/2 



For G = Sm with m > 3, for example, these bounds are sufficient to show log (imax ('ti) = 
|mlogm — |m — 0{m^/'^) as m — > oo, since \G\ = ml and s = p{m) ~ ^ exp l^y^^}' 
where the asymptotic formula for the partition function p(-) is due to Hardy and Ramanujan 
(1918) (see, e.g.. Hall (1986), Section 4.2). 

That k = |nlogn — cn steps are necessary for total variation distance to become small 
again follows directly from Theorem ^.7.3| , exactly as in Section That k = |nlog?T, + cn 
steps are also sufficient (at least in continuous time) for total variation distance to become 
small is the result of Theorem |3.7.6| , which follows. 

All of the random walks studied thus far have been discrete-time random walks. We now 
introduce the continuous-time analogue of a discrete time random walk. Changing from 



discrete to continuous time will be advantageous in the proof of Theorem 3.7.6 



Suppose that P is a probability measure defined on a finite group G. The continuized 
chain corresponding to P is the continuous-time Markov chain on G started at the identity 
e with transition rates 

p{g,h) = P{hg-') 

for g,h E G with g ^ h. We denote the distribution of the chain at time t by P^, which is 
given by 

Pt{g) := Y.^-'y_P*\9) ioigeG. 

k=0 
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The following result shows that time t„ = ^nlogn + cn is sufficient, as n — > oo, for the 
total variation distance to become small in the (continuous-time analogue of the) paired 
shuffies random walk. The result is established only in the limit because the proof relies on 
classical random graph results known only (at least to us) in the limit. 

Theorem 3.7.6. Let Q and U he the probability measures on the complete monomial 
group G I Sn defined in l \3. 7. ]\ ) and (\3.1.2J , respectively. Let Qt be the distribution at time 
t of the continuized chain corresponding to Q. Let tn = ^nlogn + cn. Then there exists a 
universal constant b > such that 

limsup \\Qt„ - U\\ty < be'" for all c> 0. 

n >oo 

Proof. The probability measure Q on G I Sn induces a probability measure R on Sn 
by defining 



i?(7r) := ^(^'^)- 



Notice that R is the probability measure on Sn defined in (|2.1.1|) . 



In order for the paired shuffies continuized chain to achieve randomness, not only must tt 
in (x; vr) be a random permutation of Sn, but also must the entries of x G G" be (uniformly) 
random elements of G. Recall that the values of the entries in positions p and g of a; are 
multiplied (on the left) by mutually inverse elements of G when x is multiplied (on the 
left) by {v]t) G C[^'^^ with r = {p q). In order to determine the amount of time needed 
to randomize the entries of x, begin with n labeled vertices. At each time that an element 
{v; {p q)) G G'^i"'^'' is generated by Q, consider an edge to be generated between positions p 
and g in a graph F. Let T be the time at which the graph F becomes connected. 

Let T* be the ffist time t > T at which Q generates an element either of the form (m; e) 
or (e; e). So we may suppose that at time T*, an element OC 'i ^ X IS multiplied (on the left) 
by a uniformly chosen random element g E G. Thus Xi is now randomized. Since T* > T, 
it follows that there exist elements Xj^, . . . ,Xj^ G x, with ji, . . . ,jm 7^ i and m > 1, such 
that, for 1 < £ < m, at some time tj^, < T, the entry Xj^ was multiplied (on the left) by a 
uniformly chosen random element hi G G and Xi was multiplied by hj^ G G. But since x, 
is now randomized, it follows that Xj^, . . . ,Xj^ are also. By then considering the elements 
"paired" with Xj^, . . . , Xj^, and so forth, this argument continues on to show that at time T* 
all of the elements of x are randomized. 

Since elements either of the form {u; e) or (e; e) are generated by Q at exponential rate 
A = 1/n, we have 
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f>{T*-T<s} = 1 - e""/". 
It then follows from the independence of T and T* — T that 
F{T* >t}= [ ie""/" ¥{T>t-s} ds 

Js=0 " 



POO 

/ e-^'FiT >t-un}I{u<t/n} du. 

Ju=0 



Since each element of the form (x; (pq)) G C[^''^\ which transposes a particular pair of entries 
{p,q}, is generated by Q at exponential rate A = the indicator (call it J{p_g}(t)) of the 

presence of any given edge {p, q} in F at time t has expectation 



Moreover (and this is the advantage of working in continuous time), the stochastic processes 
I{p,q}{-) are mutually independent. Let t = tn = |nlogn + cn. Then 

1 — e"^*''"^ = 1 — exp |— ri^"'^(logn + 2c)} ~ n~"'^(logn + 2c) as n — > oo 

for fixed c G M. It then follows from a classical random graph result of Erdos and Renyi 
(1959) (see, e.g., Graham et al (1995), Chapter 6, Section 5), that 

¥{T>t-un} = F {T > ^nhgn + {c - u)n} — > 1 - exp {-e"^^^"")} as n ^ oo 
for fixed c, m G M. Thus, by the bounded convergence theorem, 

POO 

lim P{r*>t}= / e"" [1 - exp{-e~2(^""Ul du 

Ju=0 

PC POO 

< / e-"e-2(<^-") rfw + / e-'^du 

Ju=0 



g_2c _^ g-c < 26-" 



for fixed c G 



It follows exactly as in the proof of Theorem p.6.3| that 

\\Qt-U\\Ty < \\Rt ~ Us Jtv + F{T* >t} 

for every t > 0. With t„ = log n + cn, a continuous time analogue of Theorem p.l.3| (which 
we have confirmed) asserts that there exists a universal constant a' > such that 

\\Rt - Us„ II TV < a'e'"^" for all n > 1 and all c> 0, 
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where Us^ is the uniform distribution on 5'„ defined by ( p.l.2|) . Therefore, 

hmsup ||gt„ -t/||Tv < (a' + 2)e-^ 

n >oo 

from which the desired resuh follows. □ 

The following table summarizes the number of steps (both necessary and sufficient) for the 
distance (both normalized £^ and total variation) to uniformity to become small for various 
special cases of the paired shuffles random walk analyzed in this section. 
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Random walk on G I Sn 
(with paired randomizations) 



G 



metric 



nec. or suff. 



number of steps 



proof 



sufficient 



nlogn 



Thm. 



necessary 



log n 



pf. of Thm. ^J7^ 



TV 



sufficient 



|n logn {n oo) 



Thm. ^7:61 



necessary 



in log n 



Thm. 2.7.3 



sufficient 



n log n + n log(m — 1) 



Thm. KTl: 



necessary 



|nlogn + 7nlog(m — 1^ 



pf. of Thm. KT^ 



TV 



sufficient 



in log n (n — ^ oo) 



Thm. B.7.6 



necessary 



^nlogn 



Thm. 2.7.3 



Srr 



max I in log 5„, nlogn 



sufficient 



+inlog(|m!| 
+i?T,log(p(m) 



Thm. 3.7.4 



1^ 



necessary 



max I in log 5„, ^nlogn 
+inlog(|m!| — 1 



pf. of Thm. p. 7.4 



TV 



sufficient 



in log n (n oo) 



Thm. B.7.6 



necessary 



in log n 



Thm. UTL^ 



G 

abelian 



sufficient 



nlogn + nlogdGI — 1) 



Thm. 3.7.4 



necessary 



inlogn + inlog(|G| - 1) 



pf. of Thm. 3.7.4 



TV 



sufficient 



in logn (n oo) 



Thm. R.7.6 



necessary 



in log n 



Thm. UTL^ 



G 

nonabelian 



sufficient 



max I in log 6n , n log n 
+inlog(|G|-l) 
+inlog(g - 1)| 



Thm. 3.7.4 



necessary 



max I in log inlogn 
+inlog(|G|-l)" 



pf. of Thm. p. 7.4 



TV 



sufficient 



in logn (n oo) 



Thm. 3.7.6 



necessary 



in log n 



Thm. UTL^ 
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